An Ontological Framework for Retrieving Environmental Sounds Using Semantics and Acoustic Content

نویسندگان

  • Gordon Wichern
  • Brandon Mechtley
  • Alex Fink
  • Harvey D. Thornburg
  • Andreas Spanias
چکیده

Organizing a database of user-contributed environmental sound recordings allows sound files to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using textor sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation of unlabeled sound files. To this end, we introduce an ontological framework where sounds are connected to each other based on the similarity between acoustic features specifically adapted to environmental sounds, while semantic tags and sounds are connected through link weights that are optimized based on userprovided tags. Furthermore, tags are linked to each other through a measure of semantic similarity, which allows for efficient incorporation of out-of-vocabulary tags, that is, tags that do not yet exist in the database. Results on two freely available databases of environmental sounds contributed and labeled by nonexpert users demonstrate effective recall, precision, and average precision scores for both the text-based retrieval and annotation tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vibrotactile Identification of Signal-Processed Sounds from Environmental Events Presented by a Portable Vibrator: A Laboratory Study

Objectives: To evaluate different signal-processing algorithms for tactile identification of environmental sounds in a monitoring aid for the deafblind. Two men and three women, sensorineurally deaf or profoundly hearing impaired with experience of vibratory experiments, age 22-36 years. Methods: A closed set of 45 representative environmental sounds were processed using two transposing (TRH...

متن کامل

Shortest Path Techniques for Annotation and Retrieval of Environmental Sounds

Many techniques for text-based retrieval and automatic annotation of music and sound effects rely on learning with explicit generalization, training individual classifiers for each tag. Non-parametric approaches, where queries are individually compared to training instances, can provide added flexibility, both in terms of robustness to shifts in database content and support for foreign queries,...

متن کامل

A Framework for Business Intelligence Application using Ontological Classification

Every business needs knowledge about their competitors to survive better. One of the information repositories is web. Retrieving Specific information from the web is challenging. An Ontological model is developed to capture specific information by using web semantics. From the Ontology model, the relations between the data are mined using decision tree. From all these a new framework is develop...

متن کامل

Storing and Retrieving Software Components: A Component Description Manager

The aim of the paper is to present the results of research into Component-Based software development by providing a specification mechanism allowing searching for components in a component repository. A new component classification framework is proposed based on which a Component Description Manager has been designed and implemented. The classification framework combines domain knowledge, ontol...

متن کامل

An Ontological Approach to the specification of Semantics for Learning Content. The convergence of knowledge management and technology enhanced learning

The PhD Thesis is concentrating in the convergence of knowledge management and technology enhanced learning towards the effectiveness in the design and exploitation of learning content. The main emphasis is paid to the modelling of the learning content development process and through ontological considerations the thesis contributes in theory and practice as follows: It proposes a Life Cycle mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • EURASIP J. Audio, Speech and Music Processing

دوره 2010  شماره 

صفحات  -

تاریخ انتشار 2010